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The all-pairs shortest path problem is the first non-artificial problem for which 
. it was shown that adding crossover can significantly speed up a mutation-only evo- 

I lutionary algorithm. Recently, the analysis of this algorithm was refined and it was 

shown to have an expected optimization time (w. r. t. the number of fitness evalua- 
CJ- tions) of e(n3-25(logn)°-25). 

In contrast to this simple algorithm, evolutionary algorithms used in practice 
usually employ refined recombination strategies in order to avoid the creation of in- 
feasible offspring. We study extensions of the basic algorithm by two such concepts 
, which are central in recombination, namely repair mechanisms and parent selection. 

CJ^ I We show that repairing infeasible offspring leads to an improved expected optimiza- 

tion time of 0(n^'2(log n)'^'^). As a second part of our study we prove that choosing 
parents that guarantee feasible offspring results in an even better optimization time 
^ I of 0{n^ logn). 

Both results show that already simple adjustments of the recombination operator 
can asymptotically improve the runtime of evolutionary algorithms. 
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1 Introduction 



Evolutionary algorithms [H] have been shown to be robust problem solvers for a wide 
range of combinatorial optimization problems that cannot be handled by traditional al- 
gorithmic approaches [M]. They are a premier choice for complex optimization problems 
that are highly non-linear, dynamic, and/or stochastic. Problems that can be observed 
in the real-world have most of these characteristics. Thus, it is desirable to have efficient 
algorithms that can deal with such problems that occur in such highly complex applica- 
tions. In contrast to many algorithmic approaches studied in the traditional theoretical 
computer science literature, evolutionary algorithms have the ability to solve such com- 
plex problems [H EHl [13 ESI SD]- Another main advantage of evolutionary algorithms is 
that they are easy to parallelize [SB] and viewing them in the light of a tremendous num- 
ber of increasing processors on multi-core computers, one can expect that the number of 
applications to interesting real-world problems will get a further boost during the next 
decade. 

Viewing evolutionary algorithms from the classical theoretical computer science per- 
spective, their main disadvantages is that their theoretical understanding lags far behind 
their practical success. However, in recent years, a lot of progress has been made in under- 
standing evolutionary algorithms from a rigorous viewpoint (see the books [21 129] for the 
current state of the art). With this paper we contribute to this line of research. 

One of the important issues when designing successful evolutionary algorithms is to 
choose a suitable representation of possible solutions together with good variation opera- 
tors. For problems from combinatorial optimization, different representations and variation 
operators have been discussed for a wide range of combinatorial optimization problems (see 
e.g. [301 EHl |3T]). Often, variation operators (such as crossover or mutation) are designed 
to produce feasible offspring. For mutation this is easy to achieve, as a mutation operator 
usually only applies a small number of local changes to a given feasible solution. 

However, the design of crossover operators, producing from two feasible solutions a new 
feasible one, is usually more complicated (see e. g. [21] for different crossover operators for 
the traveling salesman problem). Whenever a crossover operator produces an infeasible 
solution, one option is to discard it. However, this typically does not lead to efficient 
methods, as time is wasted on producing infeasible solutions and evaluating them. To 
deal with this situation, one can use repair mechanisms, which produce from an infeasible 
solution a feasible one based on properties of both parents [H] . Another way of dealing 
with the problem of infeasible solutions is to use specific selection methods and/or more 
problem specific crossover operators that are likely to produce promising solutions [5l [25]. 

The goal of this paper is to point out the effect of repair mechanisms and parent selec- 
tion for crossover on the runtime of evolutionary algorithms in combinatorial optimization. 
Analyzing the runtime behavior of evolutionary algorithms has become a major part in 
their theoretical analysis. Based on results for different kinds of pseudo-Boolean functions 
[T2l IT8] , results have been obtained for different kinds of combinatorial optimization prob- 
lems [2H]- Starting with some results for classical combinatorial optimization problems 
that are solvable in polynomial time such as the computation of minimum spanning trees 
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[25] or maximum matchings [TB], different results have been obtained for NP-hard prob- 
lems |27l [151 [221 112] • Analyses discussing the use of crossover operators on problems with 
a bit string representation include [T9| |2T] . A general recent study on the structure of 
crossover-based search is given in [26] . 

One cannot expect to beat the best known algorithms if the problem under consider- 
ation can be solved in polynomial time. With such studies we want to gain new insights 
on how evolutionary algorithms behave on natural optimization problems and identify into 
the important modules that make such algorithms successful. 

Here, we carry out theoretical studies on evolutionary algorithms for the computation of 
shortest paths. Computing shortest paths is one of the basic problems in computer science 
and has already been considered in various theoretical studies of evolutionary algorithms. 
There are different results for the single-source shortest path (SSSP) problem [^ [52] [5]. 

We investigate the all-pairs shortest path (APSP) problem which is a generalization of 
the SSSP problem. We are given a strongly connectecQ directed graph G = (V, E) with 
|\^| = n and \E\ = m and a weight function w : E ^ W that assigns weights to the edges. 
(We distinguish between the weight of a path given by the sum of the weight of all its edges, 
and its length, defined as the number of edges in the path.) The task is to compute from 
each vertex u & V a weight-shortest path to every other vertex v & V \ {u}. Throughout 
this paper, we assume that G does not contain cycles of negative weight, that is, the 
weight function on the edges is conservative. The APSP problem can be solved by the 
Floyd- Warshall algorithm; using appropriate data structures, APSP can be computed in 
time 0{nm + n^ logn) (see, e. g. [2?])- Our goal is not to show that evolutionary algorithms 
perform better than the best problem specific algorithms for this problem. Our aim is to 
study how general purpose algorithms can deal with the APSP problem. In particular, 
we want to examine the usefulness of crossover operators in evolutionary computation. 
By this, we want to further increase the theoretical understanding of crossover and point 
out how slightly different crossover operators change the runtime behavior of evolutionary 
algorithms for the APSP problem. 

We take the APSP problem as a prominent example to show in a rigorous way how 
different crossover operators influence the runtime of evolutionary algorithms (measured 
as the number of fitness evaluations). Recently, it has been shown that the use of crossover 
operators provably leads to better evolutionary algorithms than evolutionary algorithms 
that are just based on mutation [HI |TT]: The runtime for the mutation-and-crossover ap- 
proach is G(n^'^^(logn)°'^^), which is better than the expected optimization time of G(n^) 
of the algorithm just using mutation. In addition, ^35j studied the runtime behavior of ant 
colony optimization for this problem and proved an upper bound of 0{rt' logn). 

We will see that the evolutionary approach examined in this paper solves the APSP 
problem in expected optimization time 0{n^\ogn), which equals the best known bound 
for general purpose algorithms based on ant colony optimization mentioned just above. 

In Section [2] we introduce the algorithm that is subject to our analyses. We obtain 
a total of three variants of this algorithm by considering different crossover operators: 

^Strongly connected means that there exists a directed path between any pair of vertices. 
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the crossover from [HI E] is described in Section crossover with repair is described in 
Section [31 finally, we describe crossover with parent selection in Section HJ 

In the remaining sections we prove runtime bounds for our two new variants of crossover 
for the APSP problem. In particular, we show in Section [572] that our repair mechanisms 
speed up the optimization process to 0(n'^'^(logn)°'^). Furthermore, we show [STTl that 
our parent selection method leads to an optimization time of 0{n^logn). In Section [H] we 
discuss the general structure of the search space that is used in deriving our runtime bounds 
and apply our findings to the all-pairs bottleneck paths problem. Finally, we conclude in 
Section [3 

This paper is an extension of the conference publication [10]. Here, we present full proofs 
and additionally discuss our structural insights in the new discussion section (Section [6]). 

2 Algorithms 

For the APSP problem we examine the population-based approach introduced in [7], where 
each individual in the population is a path. Our goal is to evolve an initial population (a 
set of paths) into a population which contains, for each pair of vertices {u, v) with u v, a. 
shortest path from m to f (w. r. t. to the sum of the weights of the edges). Consequently, 
we measure the fitness of an individual as the weight of the edges that belong to the path. 

Our algorithm, called Steady State GAapsp (see Algorithm [1]), starts with a population 
V := {Pu,v = {u,v)\{u,v) G E} of size \E\, containing all paths corresponding to the 
edges of the given graph G. The variation operators produce, in each iteration, one single 
offspring. 

The Steady State GAapsp decides in each iteration whether the offspring is produced 
by crossover or mutation dependent on a parameter pc of the algorithm. With probability 
Pc a crossover operator is applied to two randomly chosen individuals of V or otherwise 
(with probability 1 —pc) mutation is used to produce the offspring. To make sure that both 
operators, mutation and crossover, are used we require Pc ^ {0, 1}. For all investigations 
in this paper, we assume that pc is chosen as an arbitrary constant, i. e. pc G ]0, 1[. 

The mutation operator takes an individual Px,y from the population and applies se- 
quentially 5* -t- 1 local operations. Here, S is randomly chosen according to the Poisson 
distribution with parameter A = 1. In a local operation, the current path is either length- 
ened or shortened by a single edge. Assume that the current individual represents a path 
Px,y = {x = vo,vi, . . . ve-i, y = ve) from x to y consisting of i edges, and denote by E^{v) 
and E^{v) the set of incoming and outgoing edges of a vertex v in G, respectively. Then 
an edge e = {u,v) G E~{x) U E~^{y) U {{x,Vi), {v£-i,y)} is chosen uniformly at random. 
If e G {(x, f i), (f£_i, y)}, the edge is removed. This means that either the first edge or 
the last edge in the path is removed leading to an individual P^^ or -P[. ,u^_j consisting of 
i — 1 edges. If e G {E^{x) U E^{y)) \ {(a;,fi), (f£_i,?/)}, the edge is added and the path 
is lengthened. Here, a new individual P^^ y or is produced that contains I + 1 edges. 
Note that a local operation applied to a valid path always leads to a new valid solution 
which implies that the mutation operator only constructs solutions which are paths. 
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1 V = {Pu,v = {U, V 

2 while true do 

3 



{u,v) G E}; 



Choose r G [0, 1] uniformly at random; 
if r < pc then 



choose two individuals Px,y and Px',y' from V u. a. r. ; 



perform crossover on P^^y and Px',y' to obtain an individual P^ 



t ) 



else 



choose one individual Px^y uniformly at random from V and mutate Px^y to 
obtain an individual P^^; 

if P!. J is a path from s to t then 

if there is no individual Ps^t ^ V then V = V U {Pst}] 
else if w{Pi,) < wiP,,) then P = (P U {P;,}) \ {P,,,}; 

Algorithm 1: Steady State GAapsp 



Typically, crossover takes two individuals and combines them into a valid path if the 
end vertex of Px^y and the start vertex of Px',y' match. Choosing both individuals uniformly 
at random from P, as it was done in [TJ [11], often does not lead to a recombined offspring 
that represents a path in the given graph. In the next section, we discuss how repair 
mechanisms can lead to more efficient evolutionary algorithms. Later on, in Section HI we 
discuss how selection methods that select promising pairs of individuals for crossover let 
us prove even better runtime bounds. 

The selection operator only accepts individuals that are paths in the graph. In addition, 
it ensures diversity with respect to the different pairs of vertices. For this reason, each 
individual Pu,v is indexed by the start vertex u and the end vertex v. In the selection step 
an offspring is only compared to an individual of the current population that has the same 
start and end vertex. It is ensured that, for each pair of vertices (m, v) with u v, at most 
one individual Pu,v is contained in the population. This implies that the population size 
of our algorithms is always at most n{n — 1). 

For our theoretical investigations, we measure the optimization time of the algorithm by 
the number of fitness evaluations until an optimal population is reached for the ffist time. 
A population is optimal if it contains a a shortest path for each pair of vertices. Finally, 
the term w. h. p. (with high probability) denotes a result that holds with probability at 
least 1 — for some c > independent of n. 



3 Crossover with Repair 

In this section, we present a simple way to increase the success probability of the crossover 
operator used in previous work. This results, as we shall prove rigorously in Section 15. 2[ 
is an optimization time of 0(n^'^(logn)°'^). 

The main reason why previous crossover operators for the APSP problem have a rela- 
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X ) > • • • {x^y^ » • • • { y' 

(a) Repair crossover for the case x' = y. 

^3 <iy--Hi 

(b) Repair crossover for the case y ^ x' . 

Figure 1: Effect of the repair crossover applied to two paths P^^y and Px'.y' for the two 
possible cases of Algorithm |2J 

tively small success probability is the fact that very often the two parent individuals simply 
do not fit together. That is, the end-point of the first is not equal to the starting point of 
the second path. Since this is a rather obvious way of failing, one might think of simple 
solutions. 

One natural way is the following. If the end-point of the first and the starting point of 
the second path are different, we try to bridge this gap by the (if existent, unique) path 
from one point to the other which is contained in our population. If the population does 
not contain such a bridging path, then the crossover operator still fails. This is what we 
shall call crossover with repair. 

Let the crossover operator with repair be defined as follows (see Figure [1] for a depiction 
of the application of the crossover operator). 

Input: P^^y = {x,...,y) and P^>^y> = {x', ...,y') 

1 ii y = x' then 

2 I P^^ = [s = X, . . . ,y = x', . . . ,t = y') merging P^^y and P^'^y' at y ; 

3 else 



if there is a path Py^^' from y to x' in V then 

= (-s = a;, . . . , . . . , x', . . . , t = ?/') merging P^^y, Py^^i and P^'^yi at their 
common endpoints; 
else 

the operator fails and returns a dummy individual with fitness worse than all 
other possible individuals; 



Algorithm 2: Crossover with Repair. 
The individual Py^x' from Line 4 is called repair-path. We refer to this operator as the 
crossover with repair or repair-crossover. 

Note that this operation replaces Lines [5] in Algorithm [H 

For the variant of the Steady State GAapsp using the repair-operator, we obtain the 
following theorem. 

Theorem 1 (Crossover with Repair). For every crossover rate pc G (0,1) there exists a 
positive absolute constant C := C{pc) such that the Steady State GA^psp using crossover 
with repair (Algorithmic has an optimization time of at most Cn^''^{logn)^''^ w. h.p. 
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A proof of this theorem can be found in Section 15.21 As mentioned in the introduction, 
this result shows that the simple repair mechanism improves on the runtime of the Steady 
State GAapsp without repair (which had a runtime of B(n^'^^(logr;,)°'^^)). 



4 Feasible Parent Selection 

The previous section introduced a simple repair mechanism that leads to an optimization 
time of 0(n^'^(log?2)°'^), which is already an improvement over the optimization time of 
e(rz^-2^(logn)°-25) for the Steady State GAapsp in [II]. Nevertheless, the crossover operator 
may still produce solutions that do not constitute paths. This is the case if the start vertex 
of the second individual does not match the end vertex of the first individual and there is 
no individual in V for repair. 

In the following, we want to make sure that the crossover operator constructs feasible 
solutions, i. e. individuals that represent paths. This is done by restricting the parent 
selection for crossover to individuals that match with respect to their endpoints. We 
choose the two individuals for crossover in Line Oof Algorithm [T] using the feasible parent 
selection procedure given in Algorithm Ei 

1 Choose Px^y € V uniformly at random. 

2 Choose Px',y' £ {Pu,v I Pu,v & V/\u = y/\v^x} uniformly at random. 

Algorithm 3: Feasible Parent Selection. 



It chooses the first individual P^^y uniformly at random from the population V and the 
second individual Px',y' uniformly at random among all individuals in V whose start vertex 
equals the end vertex y of Px,y but whose end vertex does not equal the start vertex of Px,y 
Afterwards, in Line [6] of Algorithm [H crossover is performed by concatenation (compare 



the result of this operation in Figure 1(a)). Note that, due to the selection of the two 



individuals, a path from x to y' is constructed, which implies that the crossover operator 
only constructs feasible solutions. 

This selection operator for the two parents lets us prove even better bounds on the 
optimization time than for crossover with repair as follows. 

Theorem 2 (Feasible Parent Selection). For every crossover rate pc G (0,1) there exists 
a positive absolute constant C := C{pc) such that the Steady State GAapsp using feasible 
parent selection (Algorithmic has an optimization time of at most Cn^logn w. h.p. 

We will prove this theorem in Section 15.21 Recall that the bound given equals the best 
known bound for a randomized search heuristic on the APSP problem. 



5 Runtime Analysis 

In this section we introduce notations and mathematical methods which we will use for the 
proofs of Theorem [Hand Theorem H] in the following two sections. It turns out, that in both 
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proofs the analysis follows a common scheme. At the center of the analysis is a stage-wise 
analysis of the optimization process of the Steady State GAapsp- That is, we show that the 
Steady State GAapsp passes through certain stages during its execution until it eventually 
reaches an stage where the population contains only shortest paths. Note that these stages 
are merely a concept to facilitate the runtime analysis of the Steady State GAapsp and do 
not explicitly occur in the definition (Algorithm [T]) of the Steady State GAapsp- 

As in the previous sections, we again assume that we are given a a strongly connected 
directed graph G = (V, E) on n vertices and a conservative weight function w: E M., 
i. e. G does not contain cycles of negative weight. Recall that the weight of a path is 
the sum of the weights of all its edges while its length is simply the number of its edges. 
To avoid confusion, we refer to (weight-) shortest paths in G as weight-minimal, that is, a 
M-f -path P is weight-minimal in G if all w-f -paths in G have at least the same weight as P. 

It turns out that throughout the proofs in this and the following sections we will never 
regard the actual weight of a path in G (although, of course, we will be constantly con- 
cerned with the presence of certain weight-minimal paths in the population). Instead, we 
repeatedly make use of the following observation. 

Lemma 3. Let G = {V, E) be a finite, strongly connected directed graph with a conserva- 
tive, linear weight function w: E ^M. Assume that Pu^^ and P^^v as well as P^ ^ and P^ ^ 
are paths in G, and o denotes the concatenation of two paths. If w{Pl^^) < w{Pu,x) and 
w{P'^^) < w{Px,v) holds, then we can deduce wi^Pl^ ^ ° P'xv) — '^{Pu,x ° Px,v)- 

Proof. For our linear weight function w given by the definition of the APSP we automati- 
cally get 

^^iK,x ° K,.) = HK,x) + HK,.) < HPu,x) + wiPx,.) = wiP^,x O Px,.). 

□ 

Corollary 4. // the concatenation of two paths Pu,x ° Px,v creates a weight- optimal path 
Pu^v, and if the two paths P!^^^ and P^ ,^ are weight- optimal, too, then P^ ,^ o P^^^ creates a 
weight- optimal path P^^. 

Proof. We have w{P^ ^ o P^ ^) < w{Pu,x o Px,v) due to Lemma [3] and also w{Pu,x o Px,v) < 
w{P^ x° P'xv) due to the optimality of Pu^x ° Px,v CH 

Note, that by design our crossover operators concatenates several paths to derive Pu,v 
Consequently, there is an order in which Lemma [3] and Corollary H] can be applied such that 
the property of weight-minimality holds if our crossover operators are applied to weight- 
minimal paths. Regarding mutation we ignore the ability to shrink paths by deleting edges 
at the beginning or end in our analyses. Thus, the application of mutation can also be 
regarded concatenation of two suitable paths. 

In order to define the different stages properly (which we do separately in the following 
two sections for each of the two version of the Steady State GAapsp), we distinguish all 
pairs of vertices in G for which there exists a weight-minimal path of given length. 
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Definition 5 (Vertex Pairs V^). For a G R^, let be the set of all pairs {u,v) G V"^ with 
V ^ w such that among all weight-minimal u-v-paths in G there exists a path of length at 
most a. 

Note that in tlie previous definition the number a can take any (positive) real value 
although the lengths of paths in G is are always integral. We chose this definition for the 
sole reason of producing formally correct proofs which avoid tedious rounding operators. 
For the understanding of these proofs, however, we may think of a as being integral. In 
particular, we have that = V|^j for all a G M^. Moreover, since G is strongly connected 
and a path in G is at most of length n — 1, it holds that 



(1) 



for all a G M"*" with a > n — 1. 

At this point, let us also introduce a probabilistic tool we will repeatedly use in the 
proof of Theorem [1] and Theorem [2j The following Lemma is an adaptation of the Coupon 
Collector argument. It allows us to give a lower bound on the probability of hitting each 
element of a set I using r samples, provided the probability to sample an element that has 
not yet been sampled is bounded below by a positive constant. 

Lemma 6 (Coupon Collector with Dependencies). 

Let I be a finite set, p G (0, 1), and be sequences of events indexed by I such that 



Ft 



A 



> 



p 



0<s<t 



holds for all t eN and all i E I. 

If T is the random variable that denotes the first point in time t G N such that for 
all i E I one of the events . . . , A* has occurred, then it holds for all r G that 



Pr[T > r] < |/| ■ e" 



-pr 



Proof. First, we show the lemma for r G N. For each i E I, let Bi be the event that none 
of the events A'^, . . . , A\~^ occurs. Then 



Al 



0<s<t 



Pr[5i] = Pr [ Pi A* = JJ Pr 

0<t<r 0<i<r 

holds for alH G / and thus 

Pr[5,] < {I-pY < e-P^ 

Since the two events "T > r " and "IJie/ " coincide, we obtain by the Union Bound 
(see H]) that 

Pr[T > r] < ^ Pr[5i] < |/| ■ e'^^ . 

i&I 
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Now, the lemma also follows for arbitrary positive real values of r, 

Pr[T > r] = Pr[T > \r]] < \I\ ■ e~P^''^ < \I\ ■ e'^' 

holds for all r G M+. □ 
In the following two sections, we present the proofs of Theorem [T] and Theorem [21 
that is, we give upper bounds on the optimization times of the Steady State GAapsp 
using the operator "Crossover with Repair" and the operator "Feasible Parent Selection" , 
respectively. Since the proof of Theorem [1] is more involved, we start with the proof of 
Theorem |2] in the next section. 



5.1 The Proof of the Runtime Bound for Feasible Parent Selec- 
tion 

This section is devoted to the proof Theorem |2j For simplicity, whenever we refer to 
the Steady State GAapsp in this section, we assume without further mentioning that the 
operator "Feasible Parent Selection" is applied. As discussed above, we want to analyze 
the behavior of this algorithm in stages. To this end, we say the Steady State GAapsp 
has completed the k-th stage if the population contains a weight-minimal u-v-path for 
every pair of vertices {u, v) for which there exists such a weight-minimal path of length at 
most a{k) in the graph G, where the sequence {a{k)}k(zfq is given by 

a{k) := (3/2)'= (2) 

for all k E N. Clearly, the first point in time when this event happens defines a random 
variable. We call this random variable and say it marks the end of the k-th stage. The 
following definition makes this notion precise. 

Definition 7 (Time T^). For k eN, let a{k) be as defined in ^ and let be the random 
variable that denotes the first point in time such that, for all pairs {u,v) G V^^i^y the 
population of the Steady State GAj^psp contains a weight-minimal u-v-path. 

Observe crucially that, although (u, v) G V^^^^^ implies there exists a weight-minimal 
path of length at most a{k) in G, we only require the existence of any weight-minimal 
■u-f-path in the population of the Steady State GAapsp at time T^. In particular, this 
M-f-path may be arbitrarily long. 

It is clear by the definition of the V^^(^^'s (Definition [S]) that 

To < Ti < T2 < . . . 

holds. Also note that, depending on G, the sets V^^^^^ and V^f^^_^^^ may be equal for some 
values of k. In this case, also the random variables Tk and T^+i coincide, that is, we 
have Tk = Tk+i. 

We have already seen in ([1]) that if a{k) > n — 1, then the population of the Steady 
State GAapsp contains only weight-minimal paths. Since a{n) > n — 1 for all values of n. 
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this implies that at time T„ latest, the Steady State GAapsp has found a population of 
weight-minimal paths. In other words, the random variable T„ dominates the optimization 
time of the Steady State GAapsp- Thus, in order to show Theorem [21 it is sufficient to 
show that, for every pc G (0, 1), there exists an positive absolute constant C := C{pc) such 
that 

Pr [Tn > Cn^ log n] < i. (3) 
In fact, we show an even stronger statement, given in the following proposition. 

Proposition 8. For every pc € (0,1), there exists an positive absolute constant Ci : = 
Ci(pc) such that, 

Pr [Tk > Tk-i + Ci(2/3) V logn + l] < (4) 

holds for all k & {1, . . . ,n} . 

Before we prove Proposition [HI let us first argue how it implies ([3]) and thus Theorem [2l 
To this end, consider the telescopic sum 

n 

T„ = To + J](Tfc-Tfc_i). 

fc=i 

The random variable To denotes the first point in time such that the population contains a 
weight-minimal path for all pairs {u, v) which have a weight-minimal path of length a(0) = 
1. But such a path of length 1 consists only of a single (directed) edge, and for all vertex 
pairs that form such an edge this edge is present in the initial population (Step 1 in 
Algorithm [1]) . Therefore, we always have Tq = 0. Next, suppose that 

Tfc-Tfc_i<Ci(2/3)Vlogn + l 

holds for all /c G {1, . . . , n}. In this case, we have 

n 

Tn < J^(2/3)'=)n^logn + n < 2n^\ogn + n, 

k=l 

where we bounded ^fc=i(2/3)^ by the geometric sum 

oo 

E(2/3)' < 2. 

k=l 

Thus, if we set C := 2Ci + 1, then the event 

" Vfc G {1, . . . , n} : Tfc - T^-i < C^{2/?>frv' logn + 1 " 
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implies the event " T„ < Crv^ log n " . Conversely, this means that the event " T„ > 
Cn^logn" implies the event 

"3A;e {l,...,n}: T, - Tfe_i > Ci(2/3) Vlogn + 1 ". 

Therefore, we have 

Pr [r„ > Cn^Xogn] < Pr [3k G {1, . . . , n} : - T^.i > 1 + Ci(2/3) V logn] 

and ([3D follows from Proposition |H] by the Union Bound (see pQ). 

In order to conclude the proof of Theorem [21 we still need to show Proposition [HI We 
devote the remainder of this section to this proof. 

Proposition [8] basically states that with sufficiently high probability, the time needed by 
the Steady State GAapsp between the {k — l)-th stage and the k-th stage is not too long. 
Interestingly, for the version of the Steady State GAapsp we are considering in this section 
(the one with feasible parent selection), it is sufficient to regard only the effect of crossover 
and to neglect the effect of mutation (which can only be beneficial to the analysis) on the 
optimization process. 

In our analysis, we strongly rely on the following considerations. If we perform (feasi- 
ble) parent selection and subsequent crossover on a population which contains all weight- 
minimal paths of length at most a{k — 1), then it is sufficiently likely to generate each 
weight-minimal path of length at most a{k) = (3/2)a(fc — 1). Now, the crucial observation 
is that this argument remains valid if the population contains weight-minimal paths of any 
length for every pair of vertices that has a weight-minimal paths of length at most a{k — 1) 
in the graph G. Of course, in this case we cannot guarantee the Steady State GAapsp pro- 
duces every weight-minimal path of length at most a{k). However it will instead produce 
some weight-minimal path for every pair of vertices that has a weight-minimal paths of 
length at most a{k) in the graph G. This observation is formalized in the following lemma. 

Lemma 9. For every a G with a> 1 and every pair {u,v) G ^^^3/2)0 \ > there exists 
at least a/4 different vertices x eV such that {u,x) and {x,v) are in and such that the 
concatenation of every weight-minimal u-x -path with every weight-minimal x-v -path at the 
vertex x results in a weight-minimal u-v-path. 

Proof. Let the vertex pair (m, v) be in the set V^|/2)a '^^^ ^a- definitions 

of and V^^j^-^^, there exists a weight-minimal u-w-path P^^v = {u = Uq.Ui . . . ,Ui = v) of 
length ^ with 

a < £ < (3/2)a. 

Consider the index set 

J:= {ra/2l,...,LaJ}. 

Let j E J and x := uj. Note that [aj < a < i and therefore uj is a well-defined vertex 
of P„^„. By Corollary[U the two paths Pu^x = {u = Uq, . . . ,Uj = x) and Px,v{.x = Uj, . . . ,Ui = 
v) are weight-minimal, too. Moreover, both paths are of length at most a since j < a and 
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also £ — j < (3/2)a — a/2 = a. Hence, {u,x) and {x,v) are in V^. Furthermore, again by 
Lemma El the concatenation of any two weight-minimal paths ^ and P^. ^ (of arbitrary 
length) at the vertex x is weight-minimal, too. Now, all that is left to do is to bound the 
number of possible choices for x. Since 

{1 - 1 + 1 > a/4 if 1 < a < 2, 

2-2+ 1> a/4 if2<a<4, 
a - 1 - (a/2) - 1 + 1 > a/4 if a > 4, 

there are at least a/4 ways to choose x, which concludes the proof of this lemma. □ 
Finally, with Lemma |9] at hand, we are ready to prove Proposition |8] which will also 
conclude the proof of Theorem [2l 

of Proposition\^ Let k E {1, . . . , n}. We show that (jl]) holds, that is, we show that there 
exists a positive absolute constant Ci := Ciijpc) (to be chosen later) such that the event 

"T, > Tfc_i + 1 + Ci(2/3) Vlogn" (5) 

occurs with probability at most l/rt^. 

At time Tfc_i, which marks the beginning of the fc-th stage, we have for every pair of 
vertices in a weight-minimal path in the population of the Steady State GAapsp- 

Now, we want to bound the duration of the fc-th stage, that is, number of iterations needed 
until the population also contains a weight-minimal path for every pair of vertices in V^^^f^-^ . 

As a consequence Lemma [9], we will see below that in each iteration of the fc-th stage 
the probability that the Steady State GAapsp produces a weight-minimal path for a pair 
\ least {1 /&){?>/ 2)^ p ■ Thus, informally speaking, we would expect 

a kind of Coupon Collector process to happen, which produces all such weight-minimal 
paths (there are at most n^) in an expected number of 6p~^(2/3)'^r;,^ logn^ iterations. 

However, since we only have lower bounds on the probabilities to find a pair in V^^(^) \ 
'^a{k-i) ^^"^ since the events we regard are not independent of each other, we have to be 
more careful in bounding the probability of the event that ([5]) holds. For this reason, we 
apply Lemma El 

In the notation of LemmaEl let I = Va(k-i)\^a{k) v) with t G N and (m, v) E I 

denote the event that at time Tk-i + 1 + 1 there exists a weight-minimal u-v-paih in the 
population of the Steady State GAapsp- Then, we show that there exists a p G (0, 1) such 
that 

^k^))| n 4^] (6) 

0<s<t 



Pr 



holds for alH G N and (m, v) G /. 

For this, first note that since {u, v) is not in V^^(^), with positive probability there is no 
weight-minimal w-f-path in the population of the Steady State GAapsp at time T^-i + 1. 
Therefore, the conditional probability above is well-defined. 

Next, since there exists a weight minimal-path for every vertex pair in V]j^(fc_i), Lemma[9l 
gives us that there are at least a{k — l)/4 distinct vertices x E V such that {u, x) and (x, v) 
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are in V^^(;j_]^) and the concatenation of a weight-minimal w-x-path with a weight-minimal x- 
v-path yields a weight-minimal u-v-path. For each of these vertices x, the probability that 
the Steady State GAapsp performs this particular concatenation at time Tk-i -|- 1 + t is the 
probability to (i) perform a crossover step, (ii) choose the weight-minimal u-x-paih as first 
parent, and (iii) then the weight-minimal x-f-path as second parent from the population 
at time Tjt_i + t. The event (i) has probability Pc- By the definition of the operator 
"Feasible Parent Selection" (Algorithm [3]) event (ii) happens with probability at least l/ra^ 
and event (iii) with probability at least 1/n. Moreover, these bounds are independent of 
the event whether or not there already exists a weight-minimal u-v-path in the population 
at time Tk-i + 1. Thus, Equation ([6]) indeed holds for all {u, v) e / and t eN with 

a{k - l)p, (3/2) V 

P ■■= 



Finally, since the event that at time T^-i + 1 + 1 one of the events v)^ ■ ■ ■ ^ ^[u v) 
occurred for all {u, v) E I implies the event that < T^-i + 1 -|- t, we have by Lemma |6] 
with r := Ci{2/3)''n^\ogn and Ci := ^ that 

f3/2)'^r) 24 

pr = ^ ' V (2/3)Vlogn = 41ogn > 41nn 

and therefore, since I < that 

Pr [n > Tfc_i + 1 + ^1(2/3)'^=^ logn] < l/le-^^^^" < \. 
This concludes the proof of Proposition [S] and therefore also the proof of Theorem [2J □ 



5.2 The Proof of the Runtime Bound for Crossover with Repair 

We now prove Theorem [TJ Again for the sake of brevity, whenever we refer to the Steady 
State GAapsp in this section, we now assume without further mentioning that the operator 
"Crossover with Repair" is applied. The proof will follow the same line of argument like that 
in the previous section. That is, we again want to analyze the behavior of this algorithm 
in stages. However, this time we say the Steady State GAapsp has completed the k-th 
stage if the population contains a weight-minimal w-f-path for every pair of vertices {u, v) 
for which there exists such a weight-minimal path of length at most h{k) in the graph G, 
where the sequence {h{k)}k(z.^ is now given by 

h{k) ■.= 2A(?,/2f{n\ognY'^ (7) 

for all G N. Notice the extra 24(?T,logr;,)^/^-factor in the definition above compared to 
the definition of a{k) in ([2]). 

Analogously to the previous section, we let the random variable mark the end of 
the fc-th stage. Consequently, the following definition differs from Definition [TO] only in the 
choice of the sequence {6(fc)}fcgN instead of the sequence {a{k)}kizf^. 
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Figure 2: Pui,u is an approximating path for the vertex pair {u,v) with a gap of g := 
max{i, i — j}. 

Definition 10 (Time T^). For k G N, let b{k) be as defined in Ij^ and let be the 
random variable that denotes the first point in time such that, for all pairs {u,v) G V^^j^y 
the population of the Steady State GAapsp contains a weight-minimal u-v-path. 

In the previous section, we saw that the Steady State GAapsp applying the Feasible 
Parent Selection operator can find all minimum- weight shortest paths necessary to complete 
the current stage by crossover only. For the Steady State GAapsp which we consider in this 
section (the one applying the operator "Crossover with Repair"), our analysis is slightly 
more involved since it includes both, crossover and mutation. 

To capture the interplay between crossover and mutation, we divide all stages into two 
phases, which we call the crossover phase and the mutation phase. Like before with the 
stages, the distinction of phases are merely a method of analysis and do not change the 
definition of Algorithm [TJ This means that in our analysis of the optimization behavior of 
the Steady State GAapsp we will only consider the effect of crossover during the crossover 
phases and only the effect of mutation during the mutation phases. In the actual run of 
the algorithm, however, both crossover and mutation are likely to happen in all phases. 
Still, since both operators never remove a weight-minimal path from the population, we 
are save to ignore them in our analysis. 

In order to define the two phases, we now define the notions of gaps and approximating 
paths, two concepts which were introduced in [11]. The key observation behind these 
notions is that it suffices that crossover finds a path that sufficiently well approximates a 
sought-after path, because mutation is fast enough to fill the gaps. 

Definition 11 (Approximating Path with Gap g). Let P^^, = {u = uq,ui, . . . ,ue = v) 
be a weight-minimal path in G of length i. Suppose that P^^y is a path in G with x = Ui 
and y = Uj for some indices < i < j < i. Then we call P^^y an approximating path for 
the pair {u,v) with (integral) gap g := max{i,£ — j}; compare FigurelM 

Notice crucially that it is not necessary that an approximating path for the pair {u, v) 
approximates the weight of a weight-minimal -u-f-path. In particular, every u-v-paih is an 
approximating path with gap for the vertex pair {u,v). 

With the notion of approximating paths at hand, we define the two phases of the k- 
th stage. The first phase, which we call the crossover phase, start with the beginning 
of the A;-th stage (at time T^_i) and end when for all pairs {u,v) G V^j^^, there exists a 
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weight-minimal approximating path for {u, v) with gap at most 

g{k) := {b/Gfinlogny/^ (8) 

for all /c G N. 

The second phase, the mutation phase, lasts from the end of the crossover phase to the 
end of the k-th stage. Corresponding to T^, we define the random variable Tj! which marks 
the end of the crossover phase and the beginning of the mutation phase. 

Definition 12 (Time T^')- Fork eN with k>l, let g{k) be as defined m ^ and let Til 

he the random variable that denotes the first point in time after Tl_^ such that, for all 
pairs {u,v) G V^f^j^y the population of the Steady State GAapsp contains a weight-minimal 
approximating path for the vertex pair {u, v). 

It is clear by Definition [10] and Definition [12] that 

To < T[' < T[ <T!^ <T!^< ... 

holds and that some of these inequalities may happen to be tight. Like in the previous 
section, it still holds that at time T^, the Steady State GAapsp has found a population of 
weight-minimal paths. This time, however, we bound the number of phases more careful. 
Let k* := k*{n) be the minimum integer k such that h{k) > n — 1. Then already at time 
T^. the Steady State GAapsp has found a population of weight-minimal paths. It is easy 
to see that 

k* < 2 logn. 

In order to show Theorem [T] it is again sufficient to show that, for every p^ G (0, 1), 
there exists an positive absolute constant C := C{pc) such that 

Pr [U^ > Cn3(n logn) 1/5] < ^. (9) 

Again, we show a stronger statement. The following proposition is the direct counterpart 
to Proposition [8] in the previous section. However, this time we regard both, crossover 
and mutation. Both operators have constant probability to be applied in an iteration, and 
neither can decrease the fitness of an individual. Hence we may occasionally only regard 
the effect of one of the two. 

We start with the consideration of the effects of mutation only. We apply a result 
in [9] to arrive at a population that contains with high probability a weight-minimal path 
for every vertex pair for which there exists a weight-minimal path in G of length at most 
24(n logn) 1/5. This marks the end of the Oth-stage. 

The duration of this initial stage is comparable to the duration of all next stages, during 
which the interplay between crossover and mutation plays a role. From this point on we 
consider the effects of the repair-crossover which allows for the creation of approximating 
weight-minimal path in the crossover phase of a stage. In the subsequent mutation phase, 
the mutation operator will close the gap between the approximating weight-minimal paths 
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and the weight-minimal paths we are actually aiming for (again, we do not give a full proof 
for this but refer to [9] instead). The following proposition gives a precise statement of 
these observations for the initial mutation stage (Equation [T0|) . the kth crossover phase 
(Equation [TTj) and the kth mutation phase (Equation [T2l) . 

Proposition 13. For every pc € (0, 1), there exists three positive absolute constants Ci : = 
Ci(pc); C2 '■= C2{pc), and C3 := C^i^pc) such that the three inequalities 

Pr [T^ > Cin^nlogn^/^ + Cin^logn] < (10) 

Pr [t; - > C2(4/5)^V(nlogn)i/^ + ^] ^ ^^^^ 

Pr \T' - T' > C3(5/6)V(nlogn)i/^ + Csn^logn] < ^ (12) 

n 

hold for all k e {1, • • • , k*}. 

As before, we defer the proof of Proposition [13] until we have shown how Proposition [13] 
implies (JH]) and thus Theorem [T] This follows exactly the same line of proof as in the 
previous section. We consider the telescopic sum 

TL = U + Y.{n- no + (r; - TU). 

k=l k=l 

and note that this time we do not necessarily have Tq = due to the initial stage. Consider 
the event that the three inequalities 

To < Cm^inhgnY^^ + CinHogn, 
n - TLi < C2{4/5)V{n\ognY/' + 1, 

Tfe - Tj! < C3{5/6)''n%n log n)^/^ + Cgn^ log n 

hold for all A; G {1, . . . , /c*}. If this event occurs, then we have 

k* k" 

Y.^T;:-TU) < ($^(4/5)^^)C2n^(nlogn)i/^ + F, 

k=l k=l 

k* k* 

Y^iU-n) < (5^(5/6)'=)C3n3(nlogn)V5 + C3rn3logn. 

k=l k=l 

Substituting k* <2 log n and bounding the geometric series' yields 

Ti < {Ci + (16/9)^2 + 5C3)n=^(nlogn)i/^ + C3n=^(logn)2 + 21ogn 
There exists a positive absolute constant D such that 

C3n^(logn)^ + 21ogn < Dn^{n\ognY^^ 
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holds for all n E N. Then, for C := 2 + Ci + (16/9)C2 + 5C3 + D, we apply the same union 
bound argument as in the previous section. Note that we consider the union of 2k* + 1, that 
is, of at most n events. Thus (Q follows from the three inequalities ffTOjl . f lTTj) . and f lT2|) 
in Proposition [131 Since inequality ([9]) implies Theorem [H we may conclude the proof of 
Theorem [1] by showing Proposition [T3l 

To derive the two inequalities f lTU]) and f lT^ in Proposition [T31 we apply a result from [H], 
which we do not prove here but simply state as Lemma [HJ We adapt the notation to our 
needs, a closer look into the proofs in |9j shows that this is easily possible. 

Lemma 14 (Analysis of Mutation). For all pc G (0,1) there exists a positive absolute 
constant C := C(pc) such that the following statement holds. 

Let g G and let W <^V^. Suppose that at time to G N the population of the Steady 
State GAapsp contains a weight-minimal approximating path with gap at most g for every 
vertex pair {u,v) G W. Let T be the random variable that denotes the first point in time 
such that the population of the Steady State GAapsp contains a (proper) weight-minimal 
path for every vertex pair {u, v) E W . Then 



Prob 



T > to + Cn%g + \ogn) 



1 

< — . 



The two inequalities ( ITOj) and ffT2|) in Proposition [13] directly follow from the previous 
lemma. 

of inequality / HOj) in Proposition [731 Let Ci := 24C where C := C{jpc) is the positive ab- 
solute constant provided by Lemma [TH Then inequality (11 01) follows from Lemma \\M if we 
set g := 6(0), W := V;^^^), and to = 0. □ 

of inequality [W\) in PropositionUM Let k G {l,...,k*} and let Ci := C 
where C := C{pc) is the positive absolute constant provided by Lemma [141 Then 
inequality (IT^ again follows from Lemma [TH if we set g := g{k), W := V^^^j, and replace 
to by the random variable T^_i (we may do this by the law of total probability, since 
Lemma [HI holds for every choice of to). □ 



At this point, we are only left to prove inequality f[TT]) in Proposition [T31 This is 
done similarly to the proof of Proposition [81 and will be the remainder of this section. 
In particular, we next show a lemma which is the direct counterpart to Lemma M in the 
previous section. 

The following lemma gives a lower bound on the number of different combinations how 
the Steady State GAapsp can produce a weight-minimal approximating path with gap g{k) 
for any weight-minimal path of length at most b{k) = {3/2)b{k — 1). Again, we will assume 
that for every weight-minimal graph of length at most b{k — 1) there exist a weight minimal 
path (of arbitrary length) in the population of the Steady State GAapsp at that time. 

Lemma 15. For every b G with b > 24, every g G M"*" with g < 6/12, and every pair 
{u,v) G V(\/2)b\^b ' there exists at least (6(7)^/36 different tuples {u',x,y,v') of four distinct 
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approximating path 



< b 



gap-path 



< b 



< b 



X > 



<y 



repair-path 



Ui e I Uf e F 



gap-path 



Uh & H Uj e J 



Figure 3: The path depicts the situation of Lemma [T^ with two possible gap paths and a 
repair path from x to y. The a;-?/-repair-path stems from the paths from u' to x and from y 
to v' which are input of the repair-crossover. Each of these three paths has length at most b 
and their concatenation is a approximating path with gap g for the vertex pair {u,v). 



vertices u', x, y,v' E V such that the pairs («', x), {x, y), and {y, v') are in and such that 
the concatenations of every weight-minimal u'-x-path and every weight-minimal y-v'-path 
with every weight-minimal x-y-path at the vertices x and y results in a weight-minimal 
u'-v'-path which is an approximating path with gap at most g for the pair {u,v); compare 
Figure 0. 

Proof. Let the vertex pair {u, v) be in the set V^^2)b ^'ut not in the set V^. By the definitions 
of and V'(|/2)6' there exists a weight-minimal u-v-paih P^^v = {u = Uq,Ui . . . ^u^ = v) of 
length £ with 

6 < £ < (3/2)6. 

Consider the four index sets 

/:= {0,...,L^7j}, 

F:={[6/2l,...,[6/2l + rV6l-l}, 
H:= { \b/2\ + [6/6] , . . . , \b/2\ + 2[6/6l - l}, 
J:= {l-[g\,...j]. 

Since g < b/12 and b > 24, we have that 

g < b/12 < {b/12) + 1 < \b/2], 

and, since i > b, we have that 

\b/2] + 2 \b/Q] - 1 < 5b/Q + 2<b- {b/12) <e-[g\, (13) 

the index sets /, F, H, and J are disjoint and subsets of {0, . . . , £}. Let i E I, f E F, h E H, 
and j E J. Furthermore, let u' := Ui, x := Uf, y := Uh, and v' := Uj. Then, by repeated 
application of Lemma [3] and Corollary HI the three paths Pu',x = {u' = Ui, . . . , u j = x) , 
Px,y = {x = Uf, . . . ,Uh = y) and Py^^' = {y = Uh, . . . ,Uj = v') are weight-minimal, too. 
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Moreover, all three paths are of length at most b since, by fll3p . we have both f < h < b 
and h ~ f < h < b. Furthermore, as in the proof of Lemma M, we have also also i — h < b. 
Hence, {u',x), {x,y), and {y,v') are in V^. Furthermore, again by repeated application 
of Lemma [3], the concatenation of any three weight-minimal paths -P^/ j., -P^ ^ and P^ „, (of 
arbitrary length) at the vertices x and y is weight-minimal, too. Finally, there are 

|/| ■ |F| ■ \H\ ■ \J\ = {[g\+ lf\b/6Y > 

ways to choose the tuple {u',x,y,v'), which concludes the proof of this lemma. □ 
Finally, all we are left to do is to prove of inequality ffTTj) in Proposition [131 As 
announced above, this proof will again follow the lines of the proof of Proposition |8] in the 
previous section. 

of inequality 07]) in Proposition [7^1 Let k G {1, . . . , k*}. We show that fllip holds, that 
is, we show that there exists a positive absolute constant C2 ■= C2{pc) (to be chosen later) 
such that the event 

"T; > T^_i + 1 + C2(4/5)2V(nlogn)0-2" (14) 

occurs with probability at most l/n^. 

At time T^_i, which marks the beginning of the /c-th stage, we have for every pair of 
vertices in V^^^_^-^ a weight-minimal path in the population of the Steady State GAapsp- 
Now, we want to bound the duration of the crossover phase of the fc-th stage, that is, num- 
ber of iterations needed until the population also contains a weight-minimal approximating 
path with gap at most g{k) for every pair of vertices in V^^^y 

We again apply Lemma |6l let / = V^^^^_^-^ \ V^^^^-^ and let A*^ ^-^ with t G N and {u, v) & I 
denote the event that at time Tl__^ + l + t there exists a weight-minimal approximating path 
with gap at most g{k) for the pair {u^v) in the population of the Steady State GAapsp- 
We show that there exists a p G (0, 1) such that 



Pr 



0<s<t 



holds for all t G M and {u, v) G /. 

For this, first note that since {u,v) is not in V^f^f^y with positive probability there is no 
weight-minimal w-f-path in the population of the Steady State GAapsp at time T^_]^ -|- t. 
Therefore, the conditional probability above is again well-defined. 

Next, since there exists a weight minimal-path for every vertex pair in V"jj^^_^-), Lemma [T5] 
gives us that there are at least (6(fc — l)g{k))^/36 distinct tuples {u',x,y,v') of ver- 
tices u', X, y,v' E V such that («', x), {x, y), and {x', v) are in V^^^.^) and the concatenation 
of a weight-minimal u'-x-path, a weight-minimal x-y-paih, and a weight-minimal y, f '-path 
at X and y yields a weight-minimal w'-f '-path which approximates the pair {u, v) with gap 
at most g{k). Note that, in order to apply Lemma fT5| we need here that b{k — 1) > 24, 
which holds since 

b{k - 1) > b{0) = 24(nlogn)°-2 
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and that g{k) < h{k — 1)/12, which holds since 

9{.k) < g{Q) < 246(0) < h{k - 1)/12. 

For each of these vertex tuples {u',x,y,v'), the probability that the Steady State 
GAapsp performs this particular concatenation at time T^_i + 1 + t is the probability to (i) 
perform a crossover step, (ii) choose the weight-minimal u'-x-path as first parent, and (iii) 
then the weight-minimal y-v'-paih as second parent from the population at time T^_i + 1. 
The event (i) has probability pc- By the definition of the operator "Crossover with Repair" 
(Algorithm [2]) event (ii) happens with probability at least 1/rP and event (iii) happens 
again with probability at least Moreover, these bounds are independent of the event 

whether or not there already exists a weight-minimal approximating path of gap at most g 
for the pair (u, v) in the population at time Tj^^-^ +t. Thus, Equation f|T5l) indeed holds for 
all {u, v) e I and t eN with 

{b{k-l)g{k))'^pc _ (5/4)2^=^,(71 log n) 

Finally, since the event that at time T^.^^ + 1 + 1 one of the events A^^ v)^ ■ ■ ■ ^ v) 
occurred for all {u,v) G / implies the event "T^' < Tj^^^ + 1 + 1", we have by Lemma [6] 
with r := C2{4:/5yV{nlogn)°-'^ and Ca := ^ that 

(5/4)2'=p,(nlogn)4/5 324 ,,,2fc 3. , ^02 
pr = ^ ' ' \ (4/5)2V(nlogn)°-2 = 41ogn > 41nn 

and therefore, since I < that 

Pr It;; >T;_, + 1 + C2{4:/5y''n%nlognY/''] < |/|e-4i°" < ^. 
This concludes the proof of Proposition [13] and therefore also the proof of Theorem [H □ 



6 Discussion 

Apart from the results rigorously proven we conjecture that the bounds are actually tight 
and cannot be improved by a better analysis of the process. 

In the following, we want to discuss how our runtime bounds can be extended to other 
but linear weight functions. For this, notice that the only place in the proofs of our runtime 
bounds where we refer to the actual properties of the weight-function is Lemma [31 In other 
words, if a fitness function / on all paths of a graph G satisfies Lemma [3l then our upper 
runtime bounds also hold for /. This motivates the following definition. 

Definition 16. Let G be a finite, strongly connected directed graph and let f be a non- 
negative fitness function on the set of all paths in G. Then f is called subpath optimal if 
the following holds. 
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If Pu,v = {u = Uq, . . . ,U£ = v) is an f- optimal u-v -path of length i andx = Ui andy = uj 
are vertices of Pu^v with < i < j < i, then substituting the f -optimal subpath P^^y with 
another f -optimal path P'^y yields an f -optimal path P'^^- 

Substituting a subpath ^ of Pu^^ amounts to the concatenation of the three paths Pu^x 
with P'^ y and Py^^ at the vertices x and y (where one path can possibly be empty). Recall, 
that by design our crossover operators used in the Steady State GAapsp concatenate two 
or three paths to derive Pu,v As in our analyses we ignored the ability of mutation to 
shrink paths by deleting edges, we regard the application of mutation as a concatenation 
of two paths. 

We can now apply Lemma [3] to any weight-function with non-negative weights that 
is subpath optimal. This worked for the subpath optimal fitness function associated with 
APSP that maps paths to their lengths. Another subpath optimal example is mapping 
paths to the weight of their lightest edge (and maximize); this is known as the all-pairs 
bottleneck paths problem (see [TBI EZ]) and has applications for example in voting theory 
|33] . Lemma E] is also applicable because if the minimum increases on a subpath then the 
overall minimum of the path is never decreased. 

Thus, as a direct corollary to Theorem [H the Steady State GAapsp with with repair 
on the all-pairs bottleneck paths problem has an optimization time of 0(n^'^(log ra)"'^) 
iterations with high probability; and as a corollary to Theorem |21 we have that the Steady 
State GAapsp with feasible parent selection on the all-pairs bottleneck paths problem has 
an optimization time of 0{n^\ogn) iterations with high probability. 

As a final remark we would like to relate the success of our GA to dynamic programming. 
Our analysis is oriented at the Bellman-Ford Algorithm, and in particular the stages we 
consider are basically the same stages that occur there. From this perspective, one may 
say that the Steady State GAapsp mimics the optimization behavior of the Bellman-Ford 
Algorithm. 

Still, the representation of individuals and the particular diversity mechanism used is 
natural since we would like to compute shortest paths for each pair of vertices. We see 
that the randomness introduced by the selection and crossover operators in the Steady 
State GAapsp raises subtle points in the analysis of our runtime bounds which are not 
present in that of the Bellman- Ford Algorithm. Two examples are the necessary overlap of 
the optimal subpaths in crossover and, in the case of Crossover with Repair, the interplay 
between mutation and crossover. 

In a more general setting, one may ask whether our findings relate to other problems 
that have dynamic programming algorithms. The answer to this again relates to Lemma E] 
and Definition [ini If the problem structure and representation within a GA is such that 
with sufficiently large probability the crossover operator can combine two optimal solutions 
like in Definition [161 we may hope that a GA using crossover and mutation outperforms a 
GA using mutation only. However, we are not aware of any concrete examples of such opti- 
mization problems, except for the APSP problems with subpath optimal fitness functions. 
We refer the reader to [0] for a discussion of how optimization problems which are accessible 
to dynamic programming can be solved by evolutionary algorithms with mutation only. 
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7 Conclusions 



We have shown how the use of repair mechanisms or appropriate selection strategies can 
speed up crossover-based evolutionary algorithms for the all-pairs shortest path problem 
(and some other problems with a similar structure). However, it remains a challenge to 
understand the usefulness of crossover in evolutionary computation in a rigorous way for 
other combinatorial optimization problems. 

The evolutionary algorithm examined for the APSP problem makes use of a strong 
diversity mechanism (each individual represents a path between a different pair of vertices) 
that allows to show the usefulness of crossover. Often evolutionary algorithms use much 
weaker diversity mechanism such as niching, deterministic crowding and fitness sharing and 
the goal is to compute just a single solution instead of a set of solutions. We state it as an 
open problem to show that crossover speeds up evolutionary algorithms for single-objective 
combinatorial optimization using one of the stated diversity mechanisms. 

On the other hand, multi-objective problems use in a natural way a diversity mech- 
anisms according to Pareto dominance. Often the population of an evolutionary algo- 
rithm for mult i- objective optimization contains a population which represents the different 
tradeoffs with respect to the given objective function at a certain time step. It would be 
interesting to have rigorous results that show the usefulness of crossover in evolutionary 
multi-objective optimization. 
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